The Pronouncing Dictionary of Austrian German (AGPD) and the Austrian Phonetic Database (ADABA): Report on a large Phonetic Resources Database of the three Major Varieties of German
نویسنده
چکیده
The paper gives a comprehensive overview over the results, the concepts and the methods which were developed and used to create the Pronouncing Dictionary of Austrian German (ÖAWB/AGPD) and the Austrian Pronouncing Database ADABA. The AGPD contains 42.000 entries which are based on a large audio corpus of 75.964 realisations of two model speakers each from Austria, Germany and Switzerland. The ADABA database provides 9 different ways to search the data. It also contains 24 model texts and another 30 texts showing linguistic and phonetic variation in Austria and in the other German speaking countries. The codification of Austrian standard pronunciation was based on the concept of German as a pluricentric language and on the concept of "media presentation language". Austrian pronunciation forms are presented in parallel with those of Germany and Switzerland to allow the comparison of differences between linguistically close national varieties of a language. The paper also gives a detailed characterisation of the software (transcriber, database) which was developed during the project that was supported by the Austrian national broadcasting corporation ORF and the University for Music and Dramatic Arts in Graz. Some of the software and the data can be obtained from the web site
منابع مشابه
The Pronouncing Dictionary of Austrian German and the other Major Varieties of German - A Phonetic Resources Database on the Pronunciation of German
The paper gives a comprehensive overview on the project “Varieties of Austrian German Standard pronunciation and varieties of standard pronunciation” whose primary goal is the creation of a pronouncing dictionary of Austrian German and the creation of a large data base of audio samples for research on spoken language and different forms of pronunciation in Austria. The contents of the dictionar...
متن کاملPronunciation variation in read and conversational austrian German
This paper presents the first large-scale analysis of pronunciation variation in conversational Austrian German. Whereas for the varieties of German spoken in Germany, conversational speech has been given noticeable attention in the fields of linguistics and automatic speech recognition, for conversational Austrian there is a lack in speech resources and tools as well as linguistic and phonetic...
متن کاملA Phonetic Lexicon for Adaptation in ASR for Austrian German
We present a phonetic lexicon for Austrian German, which was generated automatically from the canonic version of a German pronunciation dictionary. The lexicon is based on narrow transcription in Sam-Pa. Both the speech files and the canonic dictionary are taken from the SpeechDat-AT database. Since the recorded items are mainly read speech the differences between the canonic form and the real ...
متن کاملA Comparative Study of Intonation in Three Standard Varieties of German
This paper presents a comparative analysis of declarative intonation produced by standard speakers of German from Austria, Germany and Switzerland. The analysis was based on a directly comparable corpus of speech data. A perception test with phoneticians from the three countries suggested (1) that speakers from the three varieties produce different tunes on accented syllables, and (2) that ther...
متن کاملText-to-Speech Engine with Austrian German Corpus
This paper deals with developing a unit selection speech corpus for the Austrian variety of German by (re)using the resources for German and adapt them to Austrian German. This means adaptation on different levels such as lexicon level, phone level, or speech data level, whereas a compromise between reusing the given resource and an exact time-consuming phonetic transcription has to be found. I...
متن کامل